Obtaining the IMDb movie review dataset
Introducing the bag-of-words model  Transforming words into feature vectors
  Assessing word relevancy via term frequency-inverse document frequency 
  Cleaning text data
  Processing documents into tokensTraining a logistic regression model for document classification
Working with bigger data – online algorithms and out-of-core learning
Summary